A high-performance image processing pipeline for Polony DNA re-sequencing
نویسنده
چکیده
DNA Sequencing and re-sequencing are two fundamental tools in biological research, and applied to numerous such as genome mapping and genetic diseases related to DNA mutation. Polony DNA re-sequencing [POR06] is a modern high-throughput technique, which has been implemented in numerous systems, and the baseline software this project is based on is the image processing pipeline originally written for the Polonator G.007 machine [POL], which gathers relatively low-resolution images and extracts the relevant data to perform DNA re-sequencing operations. Due to the ongoing quest towards the cost reduction of re-sequencing large amounts of DNA material, for example the human genome, a new high-throughput parallel image processing pipeline has been designed to support a different selection of algorithms and to exploit SMP parallel computing systems, using a combination of pipeline, data-parallel and task-based parallelism patterns. The new software outperforms the original implementation in terms of total running time on a reference SMP system, while operating on 5.5x higher resolution images, as a result of both pipelining of the different processing stages on different processors and assigning multiple processors to the most computationally intensive stages, while overall load balancing amongst the stages is managed by a task-based task-stealing scheduler.
منابع مشابه
ضربکننده و ضربجمعکننده پیمانه 2n+1 برای پردازنده سیگنال دیجیتال
Nowadays, digital signal processors (DSPs) are appropriate choices for real-time image and video processing in embedded multimedia applications not only due to their superior signal processing performance, but also of the high levels of integration and very low-power consumption. Filtering which consists of multiple addition and multiplication operations, is one of the most fundamental operatio...
متن کاملHigh Performance Implementation of Fuzzy C-Means and Watershed Algorithms for MRI Segmentation
Image segmentation is one of the most common steps in digital image processing. The area many image segmentation algorithms (e.g., thresholding, edge detection, and region growing) employed for classifying a digital image into different segments. In this connection, finding a suitable algorithm for medical image segmentation is a challenging task due to mainly the noise, low contrast, and steep...
متن کاملHigh Performance Implementation of Fuzzy C-Means and Watershed Algorithms for MRI Segmentation
Image segmentation is one of the most common steps in digital image processing. The area many image segmentation algorithms (e.g., thresholding, edge detection, and region growing) employed for classifying a digital image into different segments. In this connection, finding a suitable algorithm for medical image segmentation is a challenging task due to mainly the noise, low contrast, and steep...
متن کاملFluorescent in situ sequencing on polymerase colonies.
Integration of DNA isolation, amplification, and sequencing can be achieved by the use of polymerase colonies (polonies) and cycles of fluorescent dNTP incorporation. In this paper, we present four advances that bring us closer to sequencing genomes cost-effectively using the polony technology. First, a polymerase trapping technique enables efficient nucleotide extension by DNA polymerase in a ...
متن کاملDDBJ Read Annotation Pipeline: A Cloud Computing-Based Pipeline for High-Throughput Analysis of Next-Generation Sequencing Data
High-performance next-generation sequencing (NGS) technologies are advancing genomics and molecular biological research. However, the immense amount of sequence data requires computational skills and suitable hardware resources that are a challenge to molecular biologists. The DNA Data Bank of Japan (DDBJ) of the National Institute of Genetics (NIG) has initiated a cloud computing-based analyti...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012